Audio Pitch Shifting Using the Constant-Q Transform

نویسنده

  • CHRISTIAN SCHÖRKHUBER
چکیده

Pitch shifting of polyphonic music is usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analyzing and processing music signals. Recently invertible constant-Q transforms (CQT) featuring high Q-factors have been proposed exhibiting a more suitable geometrical bin spacing. In this paper a frequency-domain pitch shifting approach based on the CQT is proposed. The CQT is specifically attractive for pitch shifting because it can be implemented by frequency translation (shifting partials along the frequency axis) as opposed to spectral stretching in the Fourier transform domain. Furthermore, the high time resolution of CQT at high frequencies improves transient preservation. Audio examples are provided to illustrate the results achieved with the proposed method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pitch Shifting of Audio Signals Using the Constant-q Transform

Pitch-scale modifications of polyphonic music are usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are thereby based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analysing and processing music signals. Recently invertible constant-Q transforms (CQT) fea...

متن کامل

Sliding with a Constant Q

The linear frequency (constant-bandwidth) scale of the FFT has long been recognised as a disadvantage for audio processing. Long analysis windows are required for adequate low-frequency resolution, while small windows offer lower latency, better handling of transients, and reduced computation cost. A constant-Q form of analysis offers the possibility of increased low-frequency resolution for a ...

متن کامل

Cq-profiles for Key Finding in Audio

Key finding in audio is based on the constant Q transform. A heuristics is suggested how to compress the constant Q transform into a 12-dimensional short-term pitch class profile. Short-term profiles are weighted by a cosine window and summed up yielding long-term profiles. The latter are matched against averaged major and minor prototype profiles.

متن کامل

An Artistic Technique for Audio-to-video Translation on a Music Perception Study

The paper presents an audio-to-visual instrument that allows sound-to-image transformation based on an empirical investigation of the relationship between four auditory parameters – pitch, amplitude, timbre, and duration and four visual parameters – color, location, shape, and size in the multimedia context. Implementing the audio-to-visual instruments involves real-time sound analysis by using...

متن کامل

A real-time variable-q non-stationary Gabor transform for pitch shifting

This paper proposes a real-time variable-Q non-stationary Gabor transform (VQ-NSGT) system for speech pitch shifting. The system allows for time-frequency representations of speech on variable-Q (VQ) with perfect reconstruction and computational efficiency. The proposed VQ-NSGT phase vocoder can be used for pitch shifting by simple frequency translation (transposing partials along the frequency...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013